## year caseid state age
## Min. :1998 1.031956019: 36 Min. : 1.00 Min. : 0.00
## 1st Qu.:2000 1.038900463: 36 1st Qu.:12.00 1st Qu.: 19.00
## Median :2003 1.013900463: 35 Median :27.00 Median : 26.00
## Mean :2004 1.011122685: 34 Mean :27.23 Mean : 34.85
## 3rd Qu.:2006 0.961122685: 33 3rd Qu.:42.00 3rd Qu.: 48.00
## Max. :2010 0.993761574: 33 Max. :56.00 Max. :100.00
## (Other) :150951 NA's :2762
## sex D_injury D_airbagAvail D_airbagDeploy
## Min. :1.00 Min. :0.00 no :45087 no :78480
## 1st Qu.:1.00 1st Qu.:1.00 yes :98839 yes :51582
## Median :2.00 Median :3.00 NA's: 7232 NA's:21096
## Mean :1.51 Mean :2.47
## 3rd Qu.:2.00 3rd Qu.:4.00
## Max. :2.00 Max. :5.00
## NA's :879
This histogram shows that the people between age 15-25 cause the most number of accidents
This histogram shows that the number of accidents caused each year is decreasing.
It is seen that females are marginally greater in count compared to males
It is seen that majority of the cars were equipped with airbags & still faced a fatal acciddent
The airbags wernt deployed in majority of the accidents, from which we can infer that airbags play an important role in safety
This graph is to show the ratio of how the airbags were deployed in the airbag available cars
The highest number of accidents were caused in State “6”
This dataset is a list of all the fatal accidents which occured in the US from the year 1998 to 2011.
The main feature of this dataset is that it shows the number of accidents happened in each year from 1998 to 2011 by sex.
The cause of the accident & the time of occurance will help in the investigation further.
Yes, a subset of the dataset without the ‘NA’ values for ‘sex’ & ‘Age’ was created. And a subset of data having only cars with airbag was created.
There wernt any unusuall distributions, the dataset was clean already.
A bargraph of accidents by age is created only for Men
A bargraph of accidents by age is created only for Women
Age vs Injury
Age vs Injury by Sex
A boxplot to compare Age vs Injury by Sex
A grid is created to show the top states which had the highest number of accidents
A boxplot to compare Age vs Airbag Availability
In this part, the injury level of the victim was compared to the age. It was found that it did not vary much across the age groups.
Injury level between the sex was observed, it was found that male were prone to higher injury than the female victim in the 20-50 age group.
People from age group 75-85 suffered the most injury
From this graph, we can see that the blue color is on the top portion of the graph and brown is spread at the bottom of the graph. i.e Females with age higher than 70 face more accidents compared to men throughout the years.
This is another detailed representation of how every year each sex face the accident
Number of accidents by Age group in Top 3 states
A boxplot to indicate the injury level in order
From this Multivariate Analysis, we observed that the majority of victims in age group 70-90 were females and in the age group 20-50 the majority is males
Three were nothing surprising or interesting